Markov decision process - PDFSEARCH.IO - Document Search Engine

Markov decision process
Results: 537

#	Item
481	Policy-contingent abstraction for robust robot control Joelle Pineau, Geoff Gordon and Sebastian Thrun School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 Add to Reading List Source URL: www.cs.cmu.edu Language: English - Date: 2003-06-04 12:29:33 Stochastic control Control theory Partially observable Markov decision process Reinforcement learning Markov decision process Automated planning and scheduling Action selection Bellman equation Abstraction Statistics Dynamic programming Markov processes
482	Applying Metric-Trees to Belief-Point POMDPs Joelle Pineau, Geoffrey Gordon School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 Add to Reading List Source URL: www.cs.cmu.edu Language: English - Date: 2004-01-17 12:21:11 Dynamic programming Partially observable Markov decision process Stochastic control Search algorithms Vector space K-d tree Norm Metric tree K-nearest neighbor algorithm Algebra Mathematics Linear algebra
483	Distributed Planning in Hierarchical Factored MDPs Carlos Guestrin Computer Science Dept Stanford University [removed] Add to Reading List Source URL: www.cs.cmu.edu Language: English - Date: 2003-06-04 17:02:40 Operations research Functions and mappings Dynamic programming Markov decision process Reinforcement learning Dantzig–Wolfe decomposition Automated planning and scheduling Functional decomposition Linear programming Mathematics Applied mathematics Mathematical optimization
484	EXECUTION-TIME COMMUNICATION DECISIONS FOR COORDINATION OF MULTI-AGENT TEAMS Maayan Roth CMU-RI-TR-08-04 Add to Reading List Source URL: www.ri.cmu.edu Language: English - Date: 2012-08-21 12:43:37 Personal Jukebox Partially observable Markov decision process Agent-based model Ace Actor model Control theory Multi-agent systems Statistics Computing
485	Applying Reinforcement Learning to Obstacle Avoidance Josh Beitelspacher University of Oklahoma, 308 Cate Center Drive Box 5242, Norman, OK[removed]USA [removed] Add to Reading List Source URL: www.netbeetle.com Language: English - Date: 2011-01-10 03:02:38 Neural networks SARSA Q-learning Reinforcement learning Temporal difference learning Backpropagation Markov decision process Algorithm Machine learning Statistics Computational neuroscience
486	Fast approximate planning in POMDPs Geoff Gordon [removed] Joelle Pineau, Geoff Gordon, Sebastian Thrun. Point-based Add to Reading List Source URL: www.cs.cmu.edu Language: English - Date: 2003-05-01 11:32:55 Pointwise product Mathematics Dynamic programming Partially observable Markov decision process Stochastic control
487	Tree Based Hierarchical Reinforcement Learning William T. B. Uther August 2002 CMU-CS[removed] Add to Reading List Source URL: reports-archive.adm.cs.cmu.edu Language: English - Date: 2003-03-11 11:32:52 Mathematical logic Theoretical computer science Markov decision process Reinforcement learning Abstraction Problem solving Statistics Mind Algorithm
488	Policy Gradient vs. Value Function Approximation: A Reinforcement Learning Shootout Technical Report No. CS-TR[removed]February 2006 Josh Beitelspacher, Jason Fager, Greg Henriques, and Amy McGovern School of Computer Sci Add to Reading List Source URL: www.netbeetle.com Language: English - Date: 2011-01-10 03:02:38 Neural networks Computational neuroscience SARSA Reinforcement learning Q-learning Computational statistics Temporal difference learning Artificial neural network Markov decision process Machine learning Statistics Mathematics
489	Algorithms for Inverse Reinfor ement Learning Andrew Y. Ng Stuart Russell ang s.berkeley.edu russell s.berkeley.edu Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2007-04-04 16:42:02 Applied mathematics Stochastic control Markov processes Operations research Mathematical optimization Linear programming Partially observable Markov decision process S0 Bellman equation Dynamic programming Statistics Mathematics
490	Trial-based Heuristic Tree Search for Finite Horizon MDPs Add to Reading List Source URL: www2.informatik.uni-freiburg.de Language: English - Date: 2013-04-05 08:41:03 Heuristics Search algorithms Mathematical optimization Heuristic function Reinforcement learning Greedy algorithm Markov decision process Algorithm Dynamic programming Mathematics Applied mathematics Statistics